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FIG. 2 

42 42 GCATGCACTG CCACAAGTAG TGAACTCATG GTTTTACCTC CTCAAGTAGA 

4192 AAACCTTTTG AGTGAATTTG AAGATTTATT CTCCCAAGAA GGACCCATTG 

4142 GGCTTCCTCC TCTTAGGGGG ATAGAACATC AAATTGACTT TATACCGGGG 

4 0 92 GCAAGCCTAC CAAATAGGCC TCCTTATAGA ACCAACCCCG AGGAAACAAA 

4 0 42 GGAGATAGAA TCACAAGTTC AAGACTTGTT GGAGAAGGGT TGGGTTCAAA 

3 9 92 AGAGCCTAAG CCCTTGTGCT GTACCTGTCT TGTTGGTGCC AAAAAAAGAT 

3942 GGAAAATGGC GTATGTGTTG TGATTGTAGA GCAATCAACA ACATCACCAT 

38 92 CAAGTATAGG CATCCAATCC CAAGGCTTGA CGATATGCTT GATGAATTGC 

38 42 ATGGGTCAAC TCTATTCTCC AAAATTGACC TTAAAAGTGG ATATCACCAA 

37 92 ATTCGAATCA AGGAGGGTGA TGAGTGGAAA ACCGCTTTTA AGACCAAATT 

37 42 TGGATTATAT GAGTGGTTGG TGATGCCCTT TGGTCTTACT AACGCTCCAA 

3 692 GTACATTCAT GAGGCTTATG AATCACACCT TGAGGGATTG TATAGGTAAA 

3 642 TATGTAGTAG TTTATTTTGA TGATATCTTA GTATATAGTA AAACCCTAGA 

35 92 AGACCATCTA AGTCACCTTA GGGAAGTTCT TCTAGTTCTT AGGAAAAATA 

3542 GTCTTTTTGC CAATAGGGAT AAGTGTACCT TTTGTGTAGA TAGCGTAGTC 

34 92 TTTTTAGGCT TTATAGTAAA CCAAAAGGGG GTGCATGTAG ATCCCGAGAA 

34 42 AATCAAAGCC ATCCGCGAGT GGCCAACTCC ACAAAATGTA AGTGATGTGA 

33 92 GAAGTTTTCA TGGGTTAGCT AGCTTCTATA GAAGGTTTGT TCCCAATTTT 

3342 TCTAGCCTAG CTTCTCCCTT GAATGAACTT GTAAAAAAAG ATGTTGCATT 

32 92 TTGTTGGAAT GAAAAGCATG AGCAAGCCTT TCAAAGGCTA AAAGCTCACT 

3242 CACCAATGCA CCCATCCTAT CTCTTCCAAA TTTTTCCAAA CTTTTGGAGA 

3192 TAGAGTGTGA TGCATCGGGA GTAGGCATAG TGCGGTTTTG TTGCAAGGTG 

3142 GACACCCCTT GCTTATTTTA GTGAAAAACT CCATGGTGCC ACCCTCACTA 

30 92 CCCCACCTAT GACAAAGACT CTATGCTCTT GTGCGACCCT AAAGACTTGG 

3042 GGAACACTAC CTTGnGTCCC AAAGAATTTG GnTATCCATA GTGATCACGA 

2 992 GTCTTTAAAA TATTTAAAGG GCCAACACAA GCTCAATAAG AGACATGCTA 

2 942 AATGGATGGA ATTTCTTGAA CAATTTCCTT ATGTCATCAA ATACAAGAAA 

28 92 GGGAGCACCA ATATAGTGGC CGATGCTCTT TCTAGACGGC ACACTCTCTT 

2 8 42 TTCAAAACTA GGTGCCCAAA TTCTTGGATT TGACCAC ATA AGAGAGCTTT 

27 92 ATCAAGAAGA TCAAGAACTC TCATCCATCT ATGCCCAATG TCTACATAGA 

27 42 GCACAAGGAG GTTACTATGT GTCCGAGGGA TATCTTTTTA AAGAAGGAAA 

2 692 ACTTTGCATT CCCCAAGGAA CACATAGAAA ACTCCTTGTC AAAGAATCAC 

2 642 ATGAAGGGGG ACTCATGGGC CATTTTGGAG TTGATAAAAC TCTAGACTTT 

25 92 TAAAAGCAAA ATTTTGTTGG CCACACATGA GGAAAGATGT CCACGACATT 

2542 GTCTAGAGTA TCTCATGTTT AAAAGCAAAG TCTAGAACAA TGCCGCTGGA 

24 92 CTCTACACCC CTTTGCCGAT TGCAAAGCTC CTTGTGAAGA CATTAGCATG 

24 42 GATTTCATTT TAGGACTTCC TAGGACTGCA AGAGGCCATG ACTCTATCTT 

23 92 TGTGGTAGTG GACCGTTTTA GCAAAATGTC TCACTTTATT CCATGCCACA 

2342 AAGTAGATGA TGCTCAAAAT ATTTCTAAAC TCTTCTTTAG AGAAGTGGTG 

22 92 AGACTCCATG GTCTCCCTAG AAGTATAGTG TCCGATAGAG ATCACCTTAA 

2242 ATATATAATT ATACACTTGT TTTTTTTCTC TTTTTTATTT TATCAAGTAA 

2192 AAAGTATTTG TTCTAGATTA TTATGAGTAT ATACTTACTT TCTGTATTTC 

2142 ATTTCTTTCT ATTTTTTATG ACGATGAAAT TTCTTATTAT ATCCAGACTT 

2092 TTCATATATA TTTTTATTTC TTTTCCATCT AGATGCTCTG TACTTTTCTT 

2042 CAGTTGAAAT TTCCACTCTC CAACAAAACA TCATTCAAGT TTTGTATAAC 

1992 ACTGTGACGT TAACCAGTTA AAATAAGAAA ATCATGTAAT ATAAATTATT 

194 2 TCAGTAGATA TTTTAGAATT ACAAATACGA TAAATAATTA AATTTAAAAA 

18 92 ATTATTAAAC AATGAATTTT TTTGGAAATT AATATAAAAC TTAGACTTGT 
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GAG GAT TCA GAA GAA GTG TCA GAA GCA GGG AAG GGT CGT TTT GTT 
49EDSEEVSEAGKGRFV 

AGG GAC TTC GAT GTA AAG CCT AAT GTC ATA TTA TAT CAC ACA CAT 
64RDFDVKPNVI LYHTH 

GTT GTG CCC ATG AAG CAG AGG CAG AAG AAT AAA GAT TGA 
79VVPMKQRQKNKD 

4 93 AGACTATGTGATTGGCAGTTTCAGACTTATTTGGCACCAAATTTATGATGCTCTTGTTGC 

553 TGTTTCAAAATTTGTACTCAAACTTTG7VACCCTTTGCAGCATCTTGCTTCTTTTTGGTCT 

613 TGCTGAATTTTGTCACAGTTATACTGTCACGAATAGTTTCTCTTCATAATAAGCAACTTT 

673 TCCTCTC 



FIG. 3 

101001 CAAAACAAAAGCAAATGCCGGTTTTCTTATTATTATTTCGAACTTTAGAC 
100151 CTTTTTGTAACGTTTCTTTAATTTTTTTCCTTGATAAAGAACCCTATTAT 
100201 ATCTTAGCTAAATATTTACCTCATTTTGTTTATGAGCTAAACCACCCCAA 
100251 AAATATTGTAGTTTTGCTTTCGGATTTAACTGCCAAGCAAGTGATTAGAT 
100301 AT ATTAAAGGAAAATGA ATGAAAGGACAAAAAAATATAAACGACAATATT 
100351 TGAATACTGATATTTATCTCCATTCTCAAATATTTTTGATTTATTGTGAC 
1004 01 AATATTTGGTTGTTTCCCATTTGCTACATCTTTGAGGACATGAAATGATA 
1004 51 ACATATATATGAACGAGTATAATACATTCTCGTTTCATTTTACAAATAAT 
100501 GTCAATTTATGCTAACATTTTTTATTTAAAAATTATCCTTATAAGATTTC 
100551 AGTGTATTATTTTACCATGGTACTGTAAAGTCGGATGCTATATATATATA 
100 601 TATATATATATATATCAAAAATGACACTGAAGAATTTATTTGAACTAAAA 
100 651 CTAAAAACGTAAAATAAAAAGAATTTTTCAAAAATCAAAAATTTTATATA 
1007 01 AAAATATAGATAAAATGTTAATATAGTACAACTTCTATTCAAACAGAGAG 

1007 51 AATAAATCTTCTATAGACAGTGAATATCCATTATAATAACGAGCAATAGT 
100801 TGTAATGTTGCAGTACAAAAAGAGAATTGTAATATTTGTGCATGATTGAG 

1008 51 AAATCTAAGTTGACTTTGAATTAAAAGGCT7\ATTCCAACAAGTACATGTA 
100 901 GAAGTTGACTATAGCTATATATTTACTACAAATTGATCATTTCAAGAAAG 
100 951 ACATTTAAATTAAGATATGCATGCATGACTTGATTGAACCCCACTCGCTT 
101001 GCTTCGTGCCATTCGACAAGATGTTACTTTTAAATGCAAGGTAAATTATG 
101051 GATATACTCTTCTGTATTTTTTGTAGTAGATATTTTTACGAAAATTGTTT 
101101 TTTTTCCAAAATCAAATGATATTTATTAATTTTCAATATAGAATTAATTA 
101151 AATTTTAATTAATTTTGAAGATTTATATGCTGC AGATTAGATTACCATTG 
101201 GTGAAATCATGTTTAGGTAAATAATAAATGATGTTGTAGTTTAGGAAAAA 
1012 51 AAAAAATTCTTTAATCTTTATGTAAGAATGTTAAACTTCAATTATAAAAA 
101301 TATGAAGCAGTATTATATAAGATGTTTAACTAATCGAATAATATTTTTTG 
101351 GGATGAAATTTTCTTGCATATGTTTCTAAAAAAATAATATGTGAAAAATT 
1014 01 AACATTCATTGTATGTTTATAAGAAATATATGTGAGTTTTGTTTAGATAA 
1014 51 ATAATACTTAAAATTAAGAATTTGTAAAGTTATACTGCACTTCAAATATG 
101501 TTATTTTTTCCTTTTATTTAAAATATCAGCAACATTCTAAATGATTTTAT 
101551 TTTCTTTAAAAAATTGAAAAAATGAAATTAGCAAATATGTAAAATTTAAA 
101601 ACGAATTTAAGAAAAAACTTTGTAAAGATATGATATGCTTTATAAAAAAA 
101651 ACTTGGTGGCGTACCTACTAAATATGATCACATTAGAGATTTGTATCCTT 
1017 01 TAGCATATAGTATGTAGTATAGATATCTATATTTTTATTTATTAAAGAGC 

1017 51 ATATTCATAATATAGGTATTATATGTTAATTACAATAAACGTTCAATTCG 

1018 01 TTATGTTAGTTTTTAGAAAACTTATTGCGTGTGCATATCAATGTGAGAAA 
1018 51 GCGACTCCACATGTGAGATGTTGGTCTGAGAAAGCTTTCTGCACTTGGTC 
101901 GGAACTACTTCATGGACTAGAATGCAATCCATCTATTCAAAGAAAAGCAG 
101951 TTGTCCATGCATGCCTCGGTTTTTCACATTTGGAAGCAGCGCAACAATGT 
102 001 CTTACATAATATGCGATCGATCACTCTGCAACCAATATTCAAGTACATAG 
102051 ACCATGACATCAAAAACATTATCACACCGAGAAGAAAGAAACGTCAATTT 
102101 GGTAACTTAATGGCGTTATGCCTGCGGTGAATTCTCCTAAGAGTTCT.CCC 
102151 AAATTTTATTGATTCCTTGTTTTTAACTTTTTCGCCAAAGAATCATACAT 
1022 01 ATAGATTTGACACCATTTCAACTTATCAAATACAAGTGAATAAATAATTT 
1022 51 CAAGCTTGAAAGGAATTTAATCATGATCTAAACCTAAACGACAAATTCTT 
102301 CACAAGTGAGAATCACTAATTGACTACCCCTTGGTCGCATATACATCATT 
102 351 GTTGTAAATCTGAAAATTGGTTTGGATTTGATCTGATATGTCATTCATAT 
102401 AAAACTTGTATTATTTATTTTAGAATTTTGCCGCAAACAGATAAATCATC 
102 4 51 ATCTATTTAGAAAATTTTCATTTGCACCACAATTAATCAGGGGAAAAGGT 



FIG. 3 (Continued) 

102551 ACAAATTTTCAAATACCACTTATGAGAAGCACTAAGATCACCTTTTTCTT 
102 601 TATGACTTTCTTTCTAAAGCTAAGCTGGTAGTCATGACTCATGATTATCC 
102 651 TTTTCCTAATGGGAATATTGTGGAAGCGGTTTCAAATCTTTAGACAAAAT 
1027 01 TCCATGGCCACTAAAAGTTAGCAAAGTTAAAATAAGTTTAAAAAAATATG 
1027 51 AGTGTACTTGGCCATATGCCATATTGTTGAGATCATAACAAGAGAAATAA 
102 8 01 TAGTTTATTGAAGTTTAGATCATAATCACAATACATCATTGCCTTCATCA 
102851 ACATTTTCCATGGATTTGAGAGGATCAACTTCAATACTAATGGTGGGGTC 
102 901 TTATTCATCCATTGCTCTCTAGCCAATTAAGCAGTTAGGTTATTTGTGTA 

102 951 CTCTAGTAGTTGCCAAATCAATCTTAATATTCACAATGTTGTAATTTCTA 
103001 ATTACGT ATAGATAAATGACTAGATAACACGTGGCTTTGGTTTTATCAGG 
103051 AAAGTTTTCCAAATCATATATATGAATGTAGAATAGTGTTCTTCATTAAT 
103101 TATTAATTAGCATCTCACCATCTGAGACTGGGAGCATGTGACAAGTTGAC 
103151 ATGTGTATTAAGAGAACTTTGAGAAAACCACTTTTATGATACTCCCATCT 
103201 GAGACTGGGATGAGTACCATTTTATAAAAATATGAGTAGTGAAAAAATAT 
103251 TCAAAAAAAATTCTAACATGTCCTTTAAAACATTTTAACCTTATAATTTT 
103301 AACAAACATCTTCCAATATGCGTTATGAAAACTTTATAAAACTTTTTTAT 
103351 AACATGCTTTTGAAAATTTTATAAATCTGTATTTTTAGAAACAAAGTGAT 
1034 01 ACTTTTGAAAATAGACAAATGAAGTGCTATTTTTTAAAATTGATATCATA 
1034 51 AGTCTTAACTGTGGTTTGTTTGAATTTTATTTATATACTTGTCAAAATAA 
103501 AACTAAATAAATAAATTAAATTATTTTATAATCATGAAGATAATATTATC 
103551 ATAAAAGATAAATATAAAATCAACAAATTTATATTTGTTAATAAAAATAC 

103 601 TTTGAGCTCTTCTTCATAAGACTTTTCCAGCTTCCATCTAGAAAATCACA 
103 651 TAAATTAAAAGATAAATAACCGAATAAACATAGTTCACATTCTAACTCTT 
1037 01 AGTCTTAGATTTGTTTTAATTTTCAAAGGTTTAGGTATTGTATATGTTTT 
1037 51 TTTTATTGGGTTGCTAGATTTTGATCCAAGAAGAAATGACGGGTTGTAGT 
103801 ATAGATGGTTTGTTTGAGTTTTTTCCCCTTGGTTTACTTCGTTTGGTTTT 
103851 TGTCCCCAGAATTGTTCTTGTACTCGCTGGTTTATGTCTCTACAAAGTCC 
103 901 ACGACCATTGCCGGCTCTTTGTATTTCAACTTGAATTCTAAATTCGATTG 

103 951 ATGAAAAAAAAATGTATCTCTTAAAGTCCATTAGTACCAAAAATAACTAT 

104 001 ATCATTACTACATAAAATAGTCTTGGGTTTTCCAAAGTATTTCGTTGATA 
104 0 51 TATGTTAAGAGTTCGAAATAGACACATAGATATAATGTTGAAATG 

10 4101 TCTCACATAATTATCTCCTTTTCTCTTCATTTCTCTACCTCTCAAGTTTC 
104151 CAATCCCACCCTAAGGTAATTTATTTCTTAACCTAAGTAAATTTGTTAAC 
104 2 01 AAATCTTAACTAGCTACAAATGTGTATTACAAGTCTTAAATAAAAACCTA 
104 2 51 CTTTAATTCAAAGGTATTAAACCTTCCTAAATTGATACTTACTTAGTATC 
104 3 01 GATCGGTCTAGTTTAGGGTTTGGACAACACACCATCATGGGGACGAAATT 
104 351 AGTCATTCTACGGTGTCCAAGACACAAATCTCGGACTCGATGTGGATATG 
104 4 01 ACACTTCATTATAACTTTTAACTTCATAAAAACTAACTATTAGGAGGAAG 
104 4 51 AATCGGAATCTGCATATCAATCACAATAGACTATAGTATACTTAGATTTT 
104 501 GATCTAATCAATGGGCTCCTTCAACTAATAAGTAGCCCACTACCAATAAT 
104 551 GAAATCATAAGACATTATTAAATTAATCAATGTTCTAAAAATACTTTGGT 
104 601 TATGTGTCCCGT AGAGCTAATGTGCACACACAATGAAAGTTGACCCGTTT 
104 651 CACTTGTCCCACTTTTATGATCTTTTCTTTTAGGTTAAATCCAACTTTTA 
104 7 01 TAATCTCATCTTGTTATCAAACAAAACTTTTGGCCTGTCTTTTTCATAAT 
104 751 TTAAAGTAACTCTCACGGAGAAAAGCCAACATTTTCTTCTTGTTTTATTC 
104 801 TTTTTAAGAAAAATGAATTCAAGGGGACCCCAAATTTAAAAGGAAAACCA 
104 851 AAACTCCTTTCTATGTATTTATTACTTGAAGTTTTCTATGTAATCAACAA 
104 901 TCCTAACAGTAGAGAATAAAAAACATCGTTTTGGGAGGTTTTATATTAGC 



FIG. 3 (Continued) 

105001 CCTCTGTCAATGGAGCTATATCACTTGTCATTTTGCTTAACCCTTTGCGG 
105051 GAAGATTGTTATGAAACAGTTTTAATGGAATTCTAGTTGCCAATGTCACG 
105101 TTTAATATGTTTTGTCCCTATACTTTATTGAATCTTATAATCTTTGTTAT 
105151 AGAATTATCTACTTTTAGTATTTTACATTAACATAATCTATAGAATTCTT 
105201 CTTTGTTCTATACAATTAAACAAGTAATATATTCTTAATACATATTAAAA 
105251 ATGGTGGTGTTGCTATCTGAGCTGTAATAGTTGATTGCTCCAGAGAAGAA 
105301 TAGACAAAAATCCTTACTTAAGAGGCCC ACCACTCTGAAAATTT AGACAA 
105351 GAAAAATTAAACAAAATTAGGTTACACATATTATCATTTATATATATGCA 
1054 01 CAACACAAAGTTGACCTTGCAATGTACTATTGAATAAAATAAATAAATGC 

1054 51 AAGAAGAGAGGGAATTATCACTGTTACCAAGAAAACAACTTCCTCTAAAC 
105501 AGGTCTCTATATATATAAACTTTAACACCTAAAGAATTAACACAGATCAA 
105 551 GAAAAAATCCTCAAAACAAAAGTTAAAGCAGAC ATG AAG CAA CAG CAA 

1 M K Q Q Q 

1055 9 9 CGT TAC TTG GTC GTC TTC ATC GTC CTT TTA AGC TTT CTT 

6RYLVVF I VLL S FL 
105638 CTG GTAAAGCTTCTTCCTTAATTATATTAAAACCCTAATTAAGATCTCATATA 

19 L 

105691 TCTGAATGTTGTATATATTTGTTGGTATAG TTT GTG AAT CTG AGT 

20 F V N L S 
1057 3 6 GAA GGA AGA ACA GGA GGA GTT GCA GAA GAA TAT TGG AAG 

25 EGRTGGVAEEYWK 

1057 7 5 AAG ATG ATG AAG AAT GAA CCG TTG CCT GAA CCA ATC AAA 
38KMMKNE PLPE P I K 

105814 GAG CTT CTC AAC AAT CCT TTT AGG ACC GCA CAA GAG AGA 
51 E LLNN P FRTAQER 

105853 TTC ATC CAG AAT TTC GAC ACC AAA TCT GTT GTC ATC ATC 
64FIQNFDTKSVVI I 

1058 92 TAC CAC AAT CCT AAT GAA TAA TCAATGAAGTCTCTCATATAG 
77 Y H N P N E 

105 934 ATATCTATGACTTTAATTTGTGTTTATGTATGGATCGACTTATACGTGCA 
105 98 4 CGTATATGTTATTAATTAAGAAAAGAAAAAGCTGCTTGAGTTGTTGTGTT 
10 60 34 ATACACGTATACTAAATATGTTCTGTTTAGTGCAGAAATGTTAACCCTAG 
1060 94 CTATAAGGGATTTTTTGTTCTTTTTTTTTTGTTACCATTAATGTGAGTGA 
10 614 4 GTGAGTTTTGTGTGATGAAAATTAGATTTGCTTCACATTTTGTTTTGATA 
106194 TATATAAATCAATATACTGTGCCTTTCGTGTCTTGTTTCTTATATTATTT 
10 6244 TGTGACATTAATTAATTATCTTATCAAAAATTTATTTTATTAACTGTGTC 
10 6294 CTATGGAAAAAGATGAACAATATGAGTTAACCTCATCTCAAGGAGATTCT 
106344 TTTTTGTTTTGTTTTTC 



FIG. 4 

1 AAGCTTTACAAATGTCCCCCAAAGATGAAACCACGTTATTATTAGTAAATCCTGAAAAGG 
61 TTAACGCTTCTGTTCCTCGAATTCTAAACCATCTGAAATATCTAGTGGTTTAAAATGGAG 
121 ACTTGAGGATATAGTCTCCTGAACCAGCTGTCACGGCTGAGTTAGATAACATTACTGAAT 
181 TTCTACGGGAGCGGTTGAAATCACTTTCGCCCCTTTAAG7VAGAAGCCTACACCGGGCACC 
2 41 TTCTTTACGCAATTCGAAATTTAGTCTTGCCAGGCAGTCGTTGGATCGAAGGTCTTTTTC 
301 GATACCGAGGAATCTGACTTTGCAAGGAATAATTCCTAATCACACCACCCCAACCCCTGA 
361 ATACACTTCAGGACCCTCTG7VAACCAACTTCGTTTCGGCTAAATCACAAGAATCTCCCAC 
421 TCATTCCGATTTTAGCCAATTAAATATGATATCGGTCTGGGAAGCCGATAAGGAAATTCT 
4 81 ACAAAAAGAGTTTATGAATGAGGAAAATAAGGAAAAGAGAGAACTATTTTTTAGGTACCC 
541 TGAAAGAGAACGAGAAAAATTTAG7UVAAAAATACTACTCTCATCTGTACACTGTTCAAAA 
601 GAATATCCnnnnnAATGGTTAGATAATATAAGAAAAGGATAAGTATGATTAAACTGAAAC 
661 CACGTCGGCAGAAACAAAGTGAATTCCCCCCTTTAGAGGAAGTTCGTTTCTTAAATATAG 
721 AAAACAAAGAAGTAGTCGCCTCCCCTTTTAAAATGATCTCAGAAAAACGAGAAGTAAGTA 
7 81 TAAAAGATATTCAAAATCTACACAGTCAACTAAATTTTACTAATCAAATGCTTTTTCAAT 
841 TAGCAAATAAAAAACAAAAGAAAAAAGmGAAAATTGAAGAAAAATCGTTAATAAAACCAT 
901 TTAAATTCTCAGAAGAAGAGATAAAACAGTTAAAAATTGGTCAAACTTTGGATTCTTTAT 
961 ACGATGAAGTAAAACAAAAGTTATCTATCTCGGTAATAAAAGAAAAACCGAAATCTAATA 
1021 ATGATATGCCCAAAAGGACAAATCCAAATC7VAGAAGTTTTAGACGAAATCGAAAAGAGAT 
1081 TAAAACAAACTCTGAACGACACAATAAATGTGATAGAAGAAACTAAAAACTCAGACTCAT 
1141 GTTCAGAGTCTCCCGATCGTATTGAAAAAATAAAACGTAATAAATCAGAGATTTCCAGTA 
1201 AGCCGAAATTTTTACACTCGCCCCACCTTCGATATCATCGAGATGGCGATGGACACCTCA 
12 61 GCATTGATGGAATGGATACTGAGTGATATGATGGATGACAGATGATGAATATAGAAAAAC 
1321 TCACGAAATAACAATGGCCGCTACAGCATATAGAGTAAAACATACCGAGGAACAAACAAT 
1381 AAAATTAATTATATCTGGATTCACGGGAGTATTAAAAGGCTGGTGGGATAATTACCTCAT 

14 41 GCCAGAACAAAAGAATTATGTTCTAAGCTGTGTAAAAATAGAAAACGAAGAAGGAATACC 
1501 ACTAATGGTGGAAACATTGGTGGTAGCAATAATTCATAACTTTATAGGAGATCCAAAGAT 

15 61 TTTTGAAGAAAGAACATCTTTATTACTTCATAATCTAAGATGTCCAACCTTAGGTGACTT 
1621 TAGATGGTATTCAGAAAATTTTTTAGCTATGGTTTTAACAAGGGAAGATTGTAGAGAACC 
1681 TTTCTGGAAAGAACGGTTTATAGCTGGATTACCGGATATCTTTGCTGAAAAGGTAAAAGA 

17 41 AAATTTACAAAAGGAATGCCCAAACACCCAATTAAAAGATGTACCATACGGGAAAATAAG 
1801 TTCAGTTGTAAAAAATACAGGTCTTCAGTTATGCAATAATATGAAAATAGAAAATAAGAT 

18 61 AAAAAAGAGTGAGAGTCAGGGCATCAAGGAATTAGGGGAATTTTGTACTCAATACGGTTA 
1921 TGAACGAAATACCCCTCCATCAAAAAATAAAAAGAAAATAGCAAAAAGAAGAACAgGGAG 
1981 AAACAAGCGCTAAAACAAGCGCTAAACCAGCACGTAAAAATTTTAGAAAAACGGTTAATT 
2 041 TTAGAAAACCATGAAAGTCTAATGATAAGCCCACTATAGTCTGTTATAAATGTGGACGCA 
2101 TAGGACACATGAAGCGAGACTGTAGACTAAAAGAAAAAATTAGTAATTTGACCATAAGTG 
2161 ATGAATTAAAAGAACAAATGGAAAAACTTCTGATAAATTCCTCCAGAAGAGGAAGAAACA 
2221 GAAGAATCAATAGGAGATTCTGATTACGAAGTATTGGACATGAGGATAACAATTGTAATT 
2281 GTGTCTATAAAATAAATACGATAAGTAGTGAATTAAAATTTGCGTTAGATTGCATTGATA 
2341 AAATTAATAATCCGGAGGAAAAGACCAAAGCCTTAATAGACATGAAAAGGCTACTCGTTG 
24 01 AAAAAGATGAACCCAGTTCATCTTCACAAAAACCTGAATTTATAGGATATGATTTTAAAG 
24 61 AAATATTGAGAAAAGCGAAAACATCACATAAAGAAATAACCATTAGCGATCTTAATAGTG 
2521 AAATAAATAAATTAAAAGCCGAAATCGAATCTATAAAAGTCGAGCTACAAGAATTAAAAG 
2581 ATAAAATTATACATGAGGAATCCATCTCCTCTGCCGACGAAAATTCACAAGAAGAGGAAG 
2 641 CTAGTAGACCTTCCATCAAAGAAATAACATACAAAAGACAAAAGTGGCATGTAAAAATAG 
27 01 CCCTAGAATTTGTTTGTTTTGTGACCGTTTCATTGTGGTCAAAGATGAGTCCTTACCTAA 
27 61 CACAATAAAAAACGTTACTCTTAAATATCAAAGGAGAGCTACAAATATCAATGAATGAAT 
2821 GACATTAATATTTTTCTTTAGTTTTAAAACTTGAATGAGTTGTTTTCATAAATATCTGAC 



FIG. 4 (Continued) 

2 941 ATATTTTTATCTCAAAGTCAACGAAATATTATAAAAGAATCAATTAAAAAAAATTATTCT 
3001 TTTGCAGAAAAAAAAATTAAAAATATGAAACTCCTCCACACCATATTACCATATTATAAA 
30 61 TATAAAAAAACCTCTCACAAATGTGCATTCTGGAATTCTTTATGTTGAGAGATTAATCTC 
3121 TAAAGAAAAAAGGTTGAGAAAGGTGCAGCAACA ATG TCT CCA TTC TGT AGA 

1 M S P F C R 

3172 AAC TTT TCA ATG GCA TGG GTG CTT ATG GCA TTT GTG TTG TTT 

7N FS MAWVLMAFVL F 
3214 GCA AAC AGT GCT ATG CCC ACA AAT GGA TCC ACT GTT GGG GTA 
21 A NSAMPTNGSTVGV 

325 6 AAA AAC ATG TTG GGT GGT AAA TTG ATG CTA AAC GTT TTA TGT 
35KNMLGGKLMLNVLC 

32 98 CCC CAT ATT GAT AAG CAA CAC ATT ATC CCG AAT GGT GGT TCA 
49 p R iDKQH I I PNGG S 

334 0 TTT GAG TGG AAG TAC AAT GGT GGT GCT CCA CCA ATA GGA CAA 

63FEWKYNGGAPPIGQ 
3382 TCA CCA TTC ATG TGT TTC TTT CGG TGG AAT AAT GTT CAT CAC 

77 S PFMCFFRWNNVHH 
342 4 TCC CTT GAT CTG TGT TCA CCA AGC AAG TAT ACT GGT TGT GAA 

91SLDLCSPSKYTGCE 
34 66 AAT GCC ATT TGG GAA ATC AAA GAA AAG CAA TTT TGT AGG TAC 

105 NAIWEIKEKQFCRY 
350 8 AGA GGT GGA CCT ATT AAT TAT TTT TGC TAT GAC TGG GAT GAT 

119 RGGPINYFCYDWDD 
3550 TAG TTATATAGATTATTCATGTTTCATCTCAATAAAAAAATGACTTTAGAGTGATTCTT 

3 60 9 AGTTTGCTTAACATTCTTACATATTCCTAACTATTCCGTCACTACCACCCGTAACTATAT 
3 669 TTATTTAAAATTAGTATCTGTCACAGTTTTATTTTTAAAAAAGGTTATGTGGATTAGAAG 
372 9 AGAGATAAATATGTAGACGGTCACCAACCTTAATTTTTGAACTATGTAAGACTATATTGA 
37 8 9 CCAAGAATATATGTTTAAACTCATTCATTTAAAGACTATATCTCCATTTATGATTATGCA 
384 9 AATGCAATTAGTTTTTTTTTTCATTGAAGAATTCAAAAGAAAGTTATCATTAAAAAGTAT 
3 90 9 CATTAAATCACTTATATGTTGTTTCTTAATATCCTTATTGTTAATAGAATAATTTTTTTT 

3 969 ATCCTTTAATTAAGGTTATTACTACTTTTTTTTCATATCTTCATTATTTTGAAATATTTT 

4 02 9 TAAAATTTATCAATTTTTGTAACACCCCAGAAAATACATGTAACTATCACTTTTTTTTTA 
4 08 9 TATTACAAATTTATGACTTATAGAAATACAAATATTAAAAATATAAGGTTCAAAACTACA 
414 9 TCCTAAAGTCTTTCAGACCCTCTGACACATGTATCATCTGCTCGTATATGTGATACAGTC 
420 9 ATCGCAGTTCACAAGATAACAAGAAAACCAAGGGTAAGCTAATGAAAAAAAATTCCATAA 
42 69 CATATTTAATTCATGCAAAAAGAACCAGTCAAAGTAATCATTTATAAACATTTCTTTAAA 
4 32 9 TATTGTTATATAAAATTTCAATATCAATTTCATCATTCATATAGACCACACATGGATCTA 
4 38 9 TTTTC AATCACAATCATTGGATTTCATTTTAATCCTACTTCGnCTTCCAGAAGACTCATT 
4 4 4 9 AAGTATGCCCCTACCAGAGACTAACACCTAATCAAAGAGAAATGATCAAGGTAAGTTCAA 
4 50 9 ACATCCAATAACGAGTGCCTACAGTGGGACCCAATGTGTATGAACTCCTTATCAGCTTCT 
4 5 69 CACCACCTGATATCTTATTCTATATGACGTAGATCATCAGTGAAACTAGAGGATCTCCGT 
4 62 9 TAAACATATGTTTTTTATACTTAATGTCATCAAACAACAACTCACACATTATCCCAAATG 
4 68 9 TATGACATCAATTTCATACAATTTTCATCATTCATATATAATACATATCATTGAATCACA 
4 7 4 9 TAACATTTAAAAATTCATACCATTCAAGAACTTTTCCAACATCAAAAGCAATATTTACTT 
4 80 9 TCAAACTATCAAAATATAATTATTATTTAATAAAGCT t 



FIG. 5 

142 00 0 TTATCTTATTTCCATATAATTGTTGTTTTACTTTCAAAATTTTTAATTTT 

141950 TTATATTTATCTTTTTACAGTTTAAAATTAATAAAATGAAACTTTTTTTC 

14190 0 TTAAATGTGTTAAAATATAAAATCAAAAAAGTTGTTATATGGTACATGGC 

14185 0 ACAATCTTATAAATTATTAATTTGAAAACGATACTTTATATAATAAAATT 

141800 ATCTTAGTTGACATTTTTATTAGTGTTTTCAATCATATTTTTGTTTGCTT 

141750 GATAAGCGTAAAACAAATCAAACTTAACGATACTTTATATAATAAAATTA 

1417 00 TCTTAGTTGACATTTTTATTAGTGTCTTCAATCATATCTTTGTTTGCTTG 

141650 ATAAGCGTAAAACAAATCAAGTAAAGTTGGGCACCTCAATTGTTTTAAAA 

141600 - AAGTTTGGGTACCTCAAAAATTAATAGGTCTTGTCAGATTCTTACAAAAA 

14155 0 AAATCTGGAAGAATTTATGAAAGAAGGGGGGGGAGGGGGGGAGGGGGGGG 

141500 AAGTGAAGATGAATATTC AACAAAAGAGGGTAGGCATGATGTTAAGTGAG 

14145 0 TTAAAAAACTATGTTAATGGAGACAATTTTCTGTTAACAAACCCGTTAAT 

1414 00 TGAAAACGATAGCATTCTTCTCTAACAATGTAAAACGATATTGTTTTATC 

141350 ATAACTACTCATTAAATTTCTGAGTTTCAAATCATATAAAGATTTAGGGG 

141300 GGTGTATTCAATTAAGGATTTGAAATGATTTGTATTAAAATGACAAATCC 

14125 0 CATGTTATTTCAAACATGAATTGTAAAAACTTTTTTAAAATCAAGTGTTA 

1412 0 0 TTAGATTAGTGATTTTAAAATGTACAACCAAACCCACTGTTATTGGAAAC 

141150 ATTTTAAGTAGTGGATTTAAAATGACTTGAGTGATTTTGGGTGGGATTGC 

14110 0 AGAAAATTTCTTAGTTAAGAATTCAAACATCCAAATCTCATGGTTTCAAG 

14105 0 TAGAATTTGGGAGAATTTTAATAACAAATCTCCTAATTTACCAAAAGTCA 

14100 0 CCAAAATCATTTAAAAACTCATTAAAATTTAAATGATTTCAAATCTCCAG 

14 0 950 TTGAATACATCCCCTTGGAATTAGAGATTTTGCTCGATTTGGGACCTAAG 

14 0 900 ATTGAATTTTGGGGATTT AGTTT AATCGTTACAACAAAATGACATCGTAT 

14 085 0 TATTGTTATAGGAAACAATGTCGTTTTCAGTTGACATGTATGTTAATAGA 

14 0800 AAATTAACTCTATTAACGGGATTTGCTAACCCATTTAACATCGTAACTAA 

14 07 50 ATGGTCAAGTCAATAAAAGTTTGGTATTTATTTGAAAAGTCAACGTAAGT 

14 07 0 0 TTGATATTTATTTGAAAAGTCAACATAAATTTGATATCTTATTTCGTTTC 

14 0 650 GACAGACATAAGGATTTACATCAATGTTTTTAATAAATTAAAGATTATTA 

14 0 600 TGACATTTTTTCCATTTAAAATTGCCAATGTTTTCGAAAGCAAGATACTC 

14 055 0 AAAATTGACATACCTAATTCAATCTACATTTGTTTGACAGCAATTCACGT 

14 0500 GCCTTGACCACATGGCACATACTGGCAATACATCAATTTTAAGGAAAAGG 

14 0450 TAGATTCGGATACAATATAATGGAAATAAGTGGAAAGGATCATTGACTAC 

14 04 00 TTGACTTGTAACAAACAACACACAGTATATAACTCATTCGACATTTACAA 

14 0350 ACAACATTGTGCTAGCTTAAACTCCCTCTCCTATTCAAAAAA ATG 
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13 97 93 GACTCATGGAGTTTCATTGTTTGAAACAATAATATAAAACATATAAAATT 
13 97 4 3 TCTATTATTCCAAGTTCCAACTTATAATAATTTGATAATCATATCATATT 
13 9 693 ATCATCTTAAGCATTCAATGCTACAAAGATAATACCCCCAAGCTATTTTA 



13 964 3 CATTAAAAGCTGAAACAGAGACACAATACTAACGATAAAAGTTCGT 

13 95 93 TCTTTATGCAACCATACATACATATACACAAAGATAGACAGGTAGTGTCC 

13 954 3 TAATAATTCTACTTGGGTGAGGTATGAACAGCAGCAACAGTAGATAC^? A 

13 94 93 TGTATCC AT ACC ACACATATTATGAGGCCCTCTGCAGATTTTGTAGTAAC 

13 9443 CATGCTCTCCCCACATCGCTCCCCACGAGTTCTTGATAATCCAA 
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FIG. 7 

G564 promoter: 
Gain of function constructs 
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Web Signal Scan Program 
Database searched: PLACE 

URL : http : //www. dna . af f rc .go . jp/btdocs/ PLACE/ 
This is the sequence you submitted 

>G564 promoter (-921 to -662), 450 bases, 3D1AOBF4 checksum. 

TGAAAAGTGAAGAAAAC C ATGTAATGAAAACAAAATGG C ACG ACAATC AA 

AAAAAGTTTTCACGCAAAATTTTCTTCAAAATTTATAACATTTTCATGTT 

GTGTTTGTTTCAAAGCCTAGAAAAACGAAGAGTTACTATTGGTAATGAAA 

AGCGAAGAAAACCACATAATAAAAACAAAATGGCACGACAATCAAGAAAA 

AGTTTTCACACAAAACTTTTTTCAAAATTTACTATGTTTATTTCGAAATT 

TAGAAAAACGAAGAGTTATTATTAGTAATGAAAAGCGAAGAAAACTACGT 

AATAAAAAACAAAATGGCACGACAATAAAAAAAGTTTTCACGCAAAATTT 

TCTTGGTGCGCAGAAAGTTATATATATTAATTAATTAATTTTCATTTACT 

TTTTTCCCTTTTTATTTTAAAGTTAAATTATTATTATTTTCATTTAAAAT 

Notation: H = A, C, or T 
R=A or G 
K = G or T 
W=A or T 



RESULTS OF YOUR SIGNAL SCAN SEARCH REQUEST 



/ tmp/ signal seqdone .9437 


: 450 base 


pairs 






Signal Database File: 
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For more information about the SignalScan Program, please contact Dr Dan S. 
Prestridge Tele: (612) 625-3744 Advanced Biosciences Computing Center. E- 
mail :danp@biosci .umn. edu 14 79 

Gortner Ave. University of Minnesota St. Paul, MN 55108 

The TFD data is at the gopher site, gopher://genome-gopher.stanford.edu. For 
more information about the WebSignalScan service, please contact Meena 
Sakharkar, meena@biomed . nus . sg, Biolnf ormatics centre, NUS . 
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Database Searched: PlantCARE 

URL : http : //sphinx. rug . ac . be : 808 0 /Plant CARE/ 
Sequence submitted: 

>G564 promoter (-921 to -662) 11/21/00 

+ GAAAAGTGAA GAAAACCATG TAATGAAAAC AAAATGGCAC GACAATCAAA AAAAGTTTTC ACGCAAAATT 
+ TTCTTCAAAA TTTATAACAT TTTCATGTTG TGTTTGTTTC AAAG C C TAG A AAAACGAAGA GTTACTATTG 
+ GTAATGAAAA GCGAAGAAAA CCACATAATA AAAACAAAAT GGCACGACAA TCAAGAAAAA GTTTTCACAC 
+ AAAACTTTTT TCAAAATTTA CTATGTTTAT TTCGAAATTT AGAAAAACGA AGAGTTATTA TTAGTAATGA 
+ AAAGCGAAGA AAACTACGTA ATAAAAAACA AAATGGCACG ACAATAAAAA AAGTTTTCAC GCAAAATTTT 
+ CTTGGTGCGC AGAAAGTTAT ATATATTAAT TAATTAATTT TCATTTACTT TTTTCCCTTT TTATTTTAAA 
+ GTTAAATTAT TATTATTTTC ATTTAAAA 

- CTTTTCACTT CTTTTGGTAC ATTACTTTTG TTTTACCGTG CTGTTAGTTT TTTTGAAAAG TGCGTTTTAA 

- AAGAAGTTTT AAATATTGTA AAAGTACAAC ACAAACAAAG TTTCGGATCT TTTTGCTTCT CAATGATAAC 

- CATTACTTTT CGCTTCTTTT GGTGTATTAT TTTTGTTTTA CCGTGCTGTT AGTTCTTTTT CAAAAGTGTG 

- TTTTGAAAAA AGTTTTAAAT GATACAAATA AAGCTTTAAA TCTTTTTGCT TCTCAATAAT AATCATTACT 

- TTTCGCTTCT TTTGATGCAT TATTTTTTGT TTTACCGTGC TGTTATTTTT TTCAAAAGTG CGTTTTAAAA 

- GAACCACGCG TCTTTCAATA TATATAATTA ATTAATTAAA AGTAAATGAA AAAAGGGAAA AATAAAATTT 

- CAATTTAATA ATAATAAAAG TAAATTTT 

3 -AFl_binding_sit 

Site Name 

3 -AFl_binding_si 
Function : 

AAGAA - mot i f 

Site Name 

AAGAA- mot i f Avena 
AAGAA - mo t i f Avena 
AAGAA - mo t i f Avena 

Function : 

ABRE 

Figure 8 (cont'd) 



Organism Position Strand 



ST 



260 



Core 
simil . 
1 . 000 



Matrix 

simil 

0.860 



sequence 



AAGAgttatt 



Organism Position Strand 



sativa 
sativa 
sativa 



6 
151 
284 



Core 
simil , 
1 . 000 
1. 000 
1 . 000 



Matrix 
simil 
0 . 903 
0 . 870 
0 . 870 



sequence 

gtgAAGAa 
gcgAAGAa 
gcgAAGAa 
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Site Name 


Organism 


Position 


Strand 


Core 


Matrix sequence 










simil . 


simil 




ABRE 


Hordeum 


293 


+ 


1.000 


0.854 actACGTaat 




vulgare 


Function: cis-acting 


element 


involved in 


the abscisi 


acid responsiveness 












ACE 














Site Name 


Organism 


Position 


Strand 


Core 


Matrix 


sequence 










simil 


simil 




ACE 


Petroselinum 


293 


+ 


1 .000 


0 . 908 


actACGTaat 




crispum 












Function 


: cis-acting 


element involved in 


light responsivenes 


s 


AE-box 












Site Name 


Organism 


Position 


Strand 


Core 


Ma t r ix 


sequence 










simil 


s imi 1 




AE-box 


Arabidopsis 


67 




1 . 000 


0 . 852 


AGAAaatt 




thaliana 












AE-box 


Arabidopsis 


345 




1.000 


0 . 852 


AGAAaatt 




thaliana 












AE-box 


Arabidopsis 


361 


+ 


1. 000 


0 . 852 


AGAAagtt 




thaliana 












Function: part of 


a module for 


light response 






ATI-motif 












Site Name 


Organism 


Position 


Strand 


Core 


Ma 1 Y~ ~i -y 

rid L, -L J.j\. 


sequence 










simil 


simil 




ATI -motif 


Solanum 


409 


+ 


1 . 000 


0 . 859 


ttttATTTaaa 




tuberosum 












Function : 


part of a light responsive module 








Box_4 














Site Name 


Organism 


Position 


Strand 


Core 


Matrix 


sequence 










simil 


simil 




Box_4 


PC 


375 


+ 


1. 000 


1 . 000 


ATTAat 


Box_4 


PC 


379 


+ 


1. 000 


1. 000 


ATTAat 


Box_4 


PC 


383 




1 . 000 


1. 000 


ATTAat 


Function : 












Box_I 














Site Name 


Organism 


Position 


Strand 


Core 


Matrix 


sequence 










simil 


simil 




Box_I 


PS 


107 




1 . 000 


1. 000 


TTTCaaa 


Box__I 


PS 


203 


+ 


1. 000 


0 . 857 


TTTCaca 


Box_I 


PS 


219 


+ 


1 . 000 


1 . 000 


TTTCaaa 


Box_I 


PS 


240 


+ 


1 . 000 


0 . 857 


TTTCgaa 


Box_I 


PS 


241 




1. 000 


0 . 857 


TTTCgaa 


Box_I 


PS 


249 




1 . 000 


0 . 857 


TTTCtaa 



Figure 8 (cont'd) 
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Function : 



BOX_II 

Site Name Organism Position Strand Core Matrix sequence 

simil. simil 

Box_II ST 139 + 1.000 0.889 TGGTaatga 

Box_II AT 161 + 1.000 0.954 CCACat_aat 

Function : 



CAAT-box 



Site Name Organism Position Strand Core Matrix sequenc 

simil. simil 

CAAT-box Hordeum vulgare 43 + 1.000 1.000 CAAT 



CAAT-box 

CAAT-box 
CAAT-box 
CAAT-box 

Function 

ERE 

Site Name 
ERE 



Arabidopsis 137 - 1.000 1.000 aCCAAt 
thaliana 

Hordeum vulgare 188 + 1.000 1.000 CAAT 

Hordeum vulgare 322 + 1.000 1.000 CAAT 

Arabidopsis 351 - 1.000 0.857 aCCAAg 
thaliana 

common cis -acting element in promoter and enhancer regions 



Organism 

Dianthus 
caryophyllus 



Position 
239 



Strand 



Core 
simil . 
1.000 



Matrix 
simil 
0 . 875 



sequence 
ATTTcgaa 



ERE 



Dianthus 
caryophyllus 



241 



1.000 0.875 ATTTcgaa 



ERE 



Dianthus 
caryophyllus 



413 



1.000 0.875 ATTTtaaa 



ERE 



Dianthus 
caryophyllus 



441 



1.000 0.875 ATTTaaaa 



ERE Dianthus 442 - 1.000 0.875 ATTTtaaa 

caryophyl lus 
Function: ethylene -responsive element 



G-box 



Site Name 


Organism 


Position 


Strand 


Core 


Matrix 


sequence 










simil . 


simil 




G-box 


Zea mays 


17 


+ 


0 . 842 


0 . 870 


CATGta 


G-box 


Zea mays 


38 


+ 


1. 000 


0 . 903 


CACGac 


G-box 


Zea mays 


94 


+ 


0 . 842 


0 . 886 


CATGtt 


G-box 


Zea mays 


183 


+ 


1 . 000 


0 . 903 


CACGac 


G-box 


Zea mays 


317 


+ 


1 . 000 


0 . 903 


CACGac 


Function : 


cis-acting 


regulatory 


element 


involved in 


light 





responsiveness 



GC-repeat 



Figure 8 (cont'd) 
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Site Name 



GC-repeatOryza sativa 
Function: ? 



Organism Position Strand 
351 



Core Matrix sequence 

simil . simil 

1.000 1.000 gCACCaag 



HSE 

Site Name Organism 



HSE 



Brassica 
oleracea 



Position Strand 



49 



Core Matrix sequence 

simil. simil 

0.944 0.878 aAAAAagtt 



HSE 



Brassica 
oleracea 



50 



0.944 0.912 aAAAAgttt 



HSE 



Brassica 
oleracea 



52 



0.944 0.874 gAAAActtt 



HSE 



Brassica 
oleracea 



66 



1 . 000 



0.97 8 aGAAAattt 



HSE 



Brassica 
oleracea 



77 



0 .833 



0.868 aTAAAtttt 



HSE 



Brassica 
oleracea 



87 



1.000 0.853 tGAAAatgt 



HSE 



Brassica 
oleracea 



196 



0.944 0.912 aAAAAgttt 



HSE 



Brassica 
oleracea 



198 



0.944 0.874 gAAAActtt 



HSE 



Brassica 
oleracea 



210 



0 . 944 



0.874 cAAAActtt 



HSE 



HSE 



HSE 



Brassica 
oleracea 

Brassica 
oleracea 

Brassica 
oleracea 



212 



213 



327 



0.944 0.912 aAAAAgttt 



0.944 0.878 aAAAAagtt 



0.944 0.878 aAAAAagtt 



HSE 



Brassica 
oleracea 



328 



0.944 0.912 aAAAAgttt 



HSE 



Brassica 
oleracea 



330 



0.944 0.874 gAAAActtt 



HSE 



HSE 



Brassica 
oleracea 

Brassica 
oleracea 



344 



361 



1 . 000 



0.97 8 aGAAAattt 



1.000 0.888 aGAAAgtta 



Figure 8 (cont'd) 
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HSE Brassica 385 - 1.000 0.853 tGAAAatta 

oleracea 

Function: cis-acting element involved in heat stress responsiveness 
I -box 



Site Name 


Organism Position 


Strand 


Core 


Matrix 


sequence 










simil . 


simil 




I -box 


Pi sum sativum 


93 


- 


0 . 857 


0 . 883 


aACATga 


I -box 


Pi sum sativum 


162 


+ 


0 . 857 


0 . 883 


cACATaa 


I - box 


Solanum tuberosum 


163 


- 


1 . 000 


1 . 000 


tATTAtgt 


I -box 


Pisum sativum 


237 


- 


0 . 857 


0 . 941 


gAAATaa 


I -box 


Pi sum sativum 


367 




1 . 000 


1 . 000 


tATATaa 


I - box 


Pisum sativum 


372 


+ 


1.000 


0.941 


tATATta 


I - box 


Pisum sativum 


391 




0 . 857 


0 . 941 


tAAATga 


I - box 


Pisum sativum 


411 




0.857 


0.883 


aAAATaa 


I -box 


Pisum sativum 


423 


+ 


0 . 857 


0 . 883 


tAAATta 


I -box 


Solanum tuberosum 


424 




1 . 000 


0 . 903 


aATAAttt 


I - box 


Arabidopsis 


426 




1 . 000 


0 . 863 


aATAAtaat 




thaliana 












I -box 


Arabidopsis 


429 




1.000 


0.863 


aATAAtaat 




thaliana 












I -box 


Solanum tuberosum 


431 


+. 


1. 000 


0 . 951 


tATTAttt 


I -box 


Pisum sativum 


433 




0. 857 


0. 883 


aAAATaa 


I -box 


Pisum sativum 


439 




0 . 857 


0 . 941 


tAAATga 


Function : 


part of a light 


responsive element 








P-box 














Site Name 


Organism Position 


Strand 


Core 


Matrix 


sequence 










simil . 


simil 




P-box 


Oryza sativa 


406 


+ 


1.000 


0 . 857 


CCTTttt 


Function : 


gibberellin-responsive element 








Prolamin_box 












Site Name 


Organism 


Position 


Strand 


Core 


Matrix 


sequence 










simil 


simil 




Prolamin-boxoryza sativa 


145 


+ 


1 . 000 


' 0 . 913 


tgaAAAGc 


Prolamin-boxOryza sativa 


278 




1 . 000 


0 . 913 


tgaAAAGc 


Function : 


cis-acting regulatory element associated with 


GCN4 




TATA - box 












Site Name 


Organism 


Position 


Strand 


Core 


Matrix 


sequence 










simil . 


simil 




TATA -box 


Daucus carota 


79 




1.000 


1 . 000 


TATAaatt 


TATA- box 


Brassica juncea 


80 




1.000 


1 . 000 


TATAaat 


TATA -box 


Helianthus 


81 




1 . 000 


1 . 000 


TATAaa 




annuus 












TATA- box 


Brassica 


82 


+ 


1.000 


0.908 


tTATAac 




oleracea 












TATA -box 


Brassica napus 


83 




1 .000 


0 .892 


gtTATA 


TATA -box 


Oryza sativa 


117 


+ 


0 .818 


0 . 912 


TAGAaaa 


TATA -box 


Oryza sativa 


169 


+ 


0 .818 


0 . 872 


TAAAaac 


TATA -box 


Zea mays 


248 


+ 


0 . 909 


0 . 879 


TTTAgaaa 



Figure 8 (cont'd) 
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TATA -box 


Oryza sativa 


250 


+ 


0 


818 


0 


912 


TAGAaaa 


TATA -box 


Oryza sativa 


302 


+ 


0 


818 


0 


912 


TAAAaaa 


TATA -box 


Oryza sativa 


325 


+ 


0 


818 


0 


912 


TAAAaaa 


TATA -box 


Daucus carota 


364 




1 


000 


0 


863 


TATAactt 


TATA -box 


Brassica juncea 


365 




1 


000 


0 


857 


TATAact 


TATA- box 


Zea mays 


366 




1 


000 


0 


879 


TATAtaac 


TATA -box 


Oryza sativa 


367 




1 


000 


0 


956 


TATAtaa 


TATA -box 


Oryza sativa 


368 


+ 


1 


000 


0 


929 


TATAtat 


TATA -box 


Oryza sativa 


369 




1 


000 


0 


929 


TATAtat 



TATA -box 


Solanum 


370 


1.000 


1 


000 


TATA t a 




tuberosum 












TATA -box 


Glycine max 


372 


+ 1.000 


0 


891 


TATAtt 


TATA -box 


Oryza sativa 


407 


0.818 


0 


872 


TAAAaag 


TATA -box 


Zea mays 


413 


0.909 


0 


879 


TTTAaaat 


TATA -box 


Zea mays 


442 


+ 0.909 


0 


879 


TTTAaaat 


Function : 


core promoter 


element around 


-30 of transcription 


start 





TC-rich_repeats 



Site Name 


Organism 


Position 


Strand 


Core 


Matrix 


sequence 










simil . 


simil 




TC-rich_repeats 


NT 


7 




1 . 000 


0 . 952 


gTTTTcttca 


TC-rich_repeats 


NT 


68 


+ 


1. 000 


1. 000 


aTTTTcttca 


TC-rich_repeats 


NT 


152 




1. 000 


0 . 909 


gTTTTcttcg 


TC-rich_repeats 


NT 


191 




1 . 000 


0. 885 


tTTTTcttga 


TC-rich_repeats 


NT 


248 




1 . 000 


0 . 914 


tTTTTctaaa 


TC-rich_repeats 


NT 


285 




1 . 000 


0 . 909 


gTTTTcttcg 


TC-rich_repeats 


NT 


346 


+ 


1 . 00D 


0 . 915 


aTTTTcttgg 



Function : 



WUN-motif 



Site Name Organism Position Strand Core Matrix sequence 

simil. simil 



WUN-motif 
oleracea 



Brassica 



18 



1.000 0.948 tCATTacat 



WUN-motif 
oleracea 



Brassica 



139 



1:000 1.000 tCATTacca 



WUN-motif 
oleracea 



Brassica 



237 



0.857 0.94 8 tTATTtcga 



WUN-motif 
oleracea 



Brassica 



242 



1.000 1.000 aAATTtcga 



WUN-motif 
oleracea 



Brassica 



272 



1.000 0.948 tCATTacta 



WUN-motif Brassica 296 - 0.857 0.948 tTATTacgt 

oleracea 

Function: wound -responsive element 



Figure 8 (cont'd) 



